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Abstract. In this paper we present a method for reformulating the 
Recommender Systems problem in an Information Retrieval one. In our 
tests we have a dataset of users who give ratings for some movies; we 
hide some values from the dataset, and we try to predict them again 
using its remaining portion (the so-called "leave- n-out approach"). 
In order to use an Information Retrieval algorithm, we reformulate this 
Recommender Systems problem in this way: a user corresponds to a 
document, a movie corresponds to a term, the active user (whose rating 
we want to predict) plays the role of the query, and the ratings are used 
as weigths, in place of the weighting schema of the original IR algorithm. 
The output is the ranking list of the documents (" users") relevant for 
the query ("active user"). We use the ratings of these users, weighted 
according to the rank, to predict the rating of the active user. We carry 
out the comparison by means of a typical metric, namely the accuracy 
of the predictions returned by the algorithm, and we compare this to the 
real ratings from users. In our first tests, we use two different Information 
Retrieval algorithms: LSPR, a recently proposed model based on Discrete 
Fourier Transform, and a simple vector space model. 
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1 Introduction 

Recommender Systems (RS) have become an important research area and, as a 
consequence, many empirical studies appeared in recent years. Many methods 
have been proposed by different, important research groups and, in particular, 
Collaborative Filtering [1, 2, 3, 4] gained a great popularity and it is nowadays 
a well known approach. It was remarked [5] that collaborative filtering shares 
fundamental aspects with Information Retrieval (IR) and there is somehow a 
continuity between these two fields of research. This work belongs to this stream. 
In fact we are developing a RS which makes use of concepts and tools used 
elsewhere in an IR context and, believing that the underlying structure could also 
provide an interesting framework for RS algorithms, we looked for experimental 
evidence of this intuition. 
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In our approach first we move from the Recommender Systems domain to 
the Information Retrieval one. To do this, we consider each user as a document 
and each movie as a term (even if we can use this approach not only for the 
movies): in this way, as in IR a document is a set of terms, in the RS field a 
user becomes as a set of movies (for which the user has given a rating). Using 
this representation, the ratings of the users are the baseline for computing the 
weights of the terms, as explained in section 3. 

Moreover, the active user becomes the query in this phase; the meaning of 
this is that in Information Retrieval we want the documents more similar to the 
query, and for the RS problem we want the users more similar to the active user. 

At this point we can use one of the several existing IR algorithms to obtain 
the ranking list, that represents the set of users more similar to the active user, 
ordered by decreasing similarity. 

Finally, as explained at the end of section 3, we use the ranking list to get the 
prcdicition for the active user; this last step brings us again in the Recommender 
Systems domain. 

The evaluation of most works in this field is carried out using "artificial" 
datasets provided by well know research groups, such as GroupLens [6, 7], or by 
Nctflix (http : //www . netf lixprize . com) [8, 9] . This approach ensures somehow 
a standard method to evaluate results. Hence, we have implemented our in-house 
algorithm using both Least Spectral Power Ranking (LSPR) model, presented 
in [10], and an algorithm based on vector space model, as conceived in the 
1960s by Salton [11, 12]; we tested it by means of a standard dataset provided 
by GroupLens. Basically, we have compared it with the "community" which 
constitutes the benchmark to overcome, in order to show the feasibility of the 
approach. 

This paper is organised as follows. In section 2 we introduce the problem 
in a more formal way, and in section 3 we describe the algorithm itself. After 
that, in section 4 we report the experimental results obtained when running this 
algorithm. Finally, in section 5 we discuss the results. 

2 Description of the problem 

We can formulate our problem as follows. We have: 

— a set U of users, \U\ = n; 

— a set / of items (movies, songs, restaurants...), |/| = m; 

— a gain function G which expresses the utility of an item for a user 

Utility is expressed by a numeric value representing a rating (the higher the 
better) varying on a chosen interval V, more formally the function G is defined 
as: 

G: Uxl V 

We want to maximize the users' utility by recommending good items and ad- 
vising against bad ones. The problem is that we do not know "a priori" all the 
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values of G, hence we have to predict users' ratings. This ability is normally 
tested in an almost empirical way showing that the system is able to predict a 
set of known ratings. 

In this paper we use a dataset provided by GroupLcns. Basically there are 
100 000 ratings (from 1 to 5) given by n — 943 users on m = 1682 movies 
and each user rated at least 20 movies. We represent this dataset with a matrix 
£) mxn ; where Dij is the rating given by the user j for the movie i (0 value is used 
if no rating is available) . Our aim is to predict the rating of a user (called "active 
user") for each movie (using the informations of the matrix D), minimizing the 
differences between the predicted ratings and the real ones. 



3 Model 

This section describes the core concepts of our framework, where we use both 
the LSPR model and the vector space model as IR algorithms. 

Basically in the LSPR model the query is viewed as a spectrum and each 
document as a set of filters, with one filter for each document term, whereas 
the vector space model views terms as basis vectors, documents and queries as 
vectors of the same space. 

Usually in Information Retrieval some weighting schemes are used for the 
terms of the documents and for the terms in the query; the basic choice is to 
use the TF-IDF weighting schema for the former, and the IDF weighting schema 
for the latter. In order to use the IR algorithms for the Recommender Systems, 
it is necessary to modify these weighting functions. Since each user becomes a 
document, and each movie becomes a term, there is a similarity between the 
matrix D and the well-known term-document matrix. At this point, consider 
an active user k e U, for which we want to predict the rating for the movie 
h E I. Starting from D, we compute a new matrix WU (that plays the role of 
the normalized TF-IDF weights matrix in Information Retrieval) as follows: 

uztt 1° if D« • Afc = 

W « = (l_J^ otherwise W 

This means that the more the rating of a user for a movie is similar to the rating 
of the active user, the more its weight (from to 1). 

The column k in this matrix is not considered, because it is for the movies 
not rated by the active user, 1 otherwise. 

After that, we compute the weights for the active user (i.e. the IDF weights 
for the query); we save these informations in the column k of the matrix WU, 
using the following formula: 

f if rii = OR D lk = 

WU ik = { /„\ t , . (2) 
[log 2 (^J otherwise, 

where rn is the number of users that have rated the movie i G I. 
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Now we are ready to use the Information Retrieval algorithm: the query is 
represented by the column k of WU, while the documents of the collection are 
the columns j ^ k of the same matrix with WUhj ^ 0. The output of the model 
is the ranking list of the documents, ordered by increasing relevance. This means 
that the collection is the set of users that have rated the movie h, and the output 
is the same set of users ordered from the more to the less "similar" to the active 
user. 

The last operation is to predict the rating. To do this, we use the ratings of 
the users in the ranking list, weighted by their rank, so that the smaller the rank 
of the user is, the more his rating is considered. Suppose the ranking is given by 
the list of users R, where \R\ is the number of retrieved users, the rank of each 
user is from (first) to \R\ — 1 (last), and Dhj( r ) is the rating for the movie h 
of the user with rank r. The predicted rating is computed as: 



Figure 1 summarizes the algorithm. Basically, the rows from 1 to 14 represent 
the operation described by equation (1), while the rows from 15 to 23 implement 
the equation (2). Finally there is the call to the Information Retrieval algorithm, 
and the prediction of the rating, according to equations (3) and (4). 

4 Evaluation of the algorithm 

Many different measures are used in order to evaluate the performance of filtering 
algorithms employed by Rccommcnder Systems and some metrics fit better for 
top-N recommendation, and others for prediction [13]. We decided to use a simple 
metric to evaluate our system, in order to have clear preliminary results easy to 
understand. We basically used the accuracy computed as the square root of the 
averaged squared difference between each prediction and the actual rating (the 
root mean squared error or "RMSE" ) . 

Let pik denote the predictions generated by a certain algorithm for a set with 
fc = l,2,...,|fc|, and let the actual ratings provided by a certain user be denoted 
by ra lk (k = 1, 2, |fc|). RMSE is defined by: 



R\-l 




Phk = ^ 

where A is the normalization term, computed as 



(3) 





RMSE 




In our tests, we round the pik values to the closest integer number, because 
the real ratings are integers. As mentioned above, the evaluation of RMSE is 
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Algoritm: RecSys-to-IR 

Input: data set D, active user k, movie h, IR algorithm 
Output: prediction p hk 

1 for each i € I 

2 do 

3 m <- 

4 for each j € U\j k 

5 do 

6 if (Dij ■ D ik = 0) 

7 then 

8 WUij <- 

9 else 

10 WUn <- 1 - ^ElizEihi 

11 m <— rii + 1 

12 end if 

13 end for 

14 end for 

15 for each i € I 

16 do 

17 if (n; = 0) 

18 then 

19 Wt/ifc 

20 else 

21 WUi* ^- log(^) 

22 end if 

23 end for 

24 Call the IR algorithm, and get the ranking list R 

25 p hk <- Round (^2 ■ ^ (^ml^ll ^ 

26 return p^fe 



Fig. 1: The prediction algorithm. 



typically performed using the "leave-n-out" approach [14], where a part of the 
dataset is hidden and the rest is used as a training set for the Recommender 
Systems, which tries to predict properly the withheld ratings. 

We calculate the predictions with LSPR and the basic vector space algorithms 
using data from the training set and we compare the prediction against the real 
rating in the test set. As a benchmark to evaluate the algorithm we employ the 
community average for a certain item, with the aim of measuring how much 
our algorithm can improve the simple community recommendation. Thus, we 
also compute the RMSE of the community recommendation with respect to the 
actual ratings provided by the users. 
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The results reported in table 1 refer to the analysis we performed using the 
dataset from GroupLens 3 described above. We used five couples (training set, 
test set) which share the same composition (80%/20% spilts of the orginal data 
into training and test data) as suggested by the guidelines of GroupLens itself. 



set 


LSPR 


vector space 


community 


1 


0.985 


0.989 


1.073 


2 


0.974 


0.984 


1.067 


3 


0.971 


0.980 


1.060 


4 


0.967 


0.979 


1.056 


5 


0.975 


0.984 


1.065 



Table 1: RMSE 



In table 2 we express this result in relative terms by providing the rate of 
improvement with respect to the average of the ratings by the community: LSPR 
overcomes the community by 8.4% on average, while the vector space model 
decreases the RMSE by 7.6%. 



set 


Improvement LSPR 


Inprovement vector space 


1 


8.2 % 


7.8 % 


2 


8.7 % 


7.8 % 


3 


8.4 % 


7.5 % 


4 


8.4 % 


7.3 % 


5 


8.5 % 


7.6 % 


mean 


8.4 % 


7.6 % 



Table 2: Improvement over community average. 



5 Discussion and conclusions 

A first consideration is that the algorithm already outperforms the community 
even if the gap is not prodigious. As a matter of fact, other algorithms recently 
proposed by different authors, like [15, 16], show RMSE values in the range 0.88 
- 0.95 for the same dataset. However, the aim of this paper is to show that our 
new approach could be a basis for a more sophisticated algorithm, rather than 
presenting an algorithm already comparable with the state-of-the-art. 

So, first of all, we plan to work on fine-tuning our algorithm, to extend this 
empirical evaluation and to compare it with some well known algorithms such 
as KNN or Slope-one [17]. 



3 The dataset can be found on http://www.grouplens.org. 
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We also plan to perform further experiments with a larger number of datasets 
and with a finer grain analysis of sensitivity of the algorithm with respect to the 
size of the data set. In parallel, we are also working on LSPR to improve its 
scalability and to include a number of optimization techniques. 

Moreover, we plan to use other weighting schemes, as the well-known Okapi 
BM25, and other IR algorithms, for example probabilistic models like Terrier 
[18, 19]. The final aim of this work is to merge the results to obtain better 
performances, as already done in Information Retrieval [20]. 

Basically, we are still trying to better understand if our approach can pro- 
vide a nice outcome in the Recommender Systems field and we consider the 
present work as a first answer, so our contribution is an experimental investiga- 
tion of some possible relations between Information Retrieval and Recommender 
Systems. In this sense it seems very interesting that the results obtained with 
LSPR are better than the ones obtained with the vector space model, reflecting 
the behaviour of these models in the IR field, as reported in [10]. 
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